Monte-Carlo Tree Search

نویسنده

  • G. Weiss
چکیده

representation of the game. It was programmed in LISP. Further use of abstraction was also studied by Friedenbach (1980). The combination of search, heuristics, and expert systems led to the best programs in the eighties. At the end of the eighties a new type of Go programs emerged. These programs made an intensive use of pattern recognition. This approach was discussed in detail by Boon (1990). In the following years, different AI techniques, such as Reinforcement Learning (Schraudolph, Dayan, and Sejnowski, 1993), Monte Carlo (Brügmann, 1993), and Neural Networks (Richards, Moriarty, and Miikkulainen, 1998), were tested in Go. However, programs applying these techniques were not able to surpass the level of the best programs. The combination of search, heuristics, expert systems, and pattern recognition remained the winning methodology. Brügmann (1993) proposed to use Monte-Carlo evaluations as an alternative technique for Computer Go. His idea did not got many followers in the 1990s. In the following decade, Bouzy and Helmstetter (2003) and Bouzy (2006) combined Monte-Carlo evaluations and search in Indigo. The program won three bronze medals at the Olympiads of 2004, 2005, and 2006. Their pioneering research inspired the development of Monte-Carlo Tree Search (MCTS) (Coulom, 2006; Kocsis and Szepesvári, 2006; Chaslot et al., 2006a). Since 2007, MCTS programs are dominating the Computer Go field. MCTS will be explained in the next chapter. 2.6 Go Programs MANGO and MOGO In this subsection, we briefly describe the Go programs MANGO and MOGO that we use for the experiments in the thesis. Their performance in various tournaments is discussed as well.4

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Monte-Carlo Hex

We present YOPT a program that plays Hex using Monte-Carlo tree search. We describe heuristics that improve simulations and tree search. We also address the combination of Monte-Carlo tree search with virtual connection search.

متن کامل

Efficient Sampling Method for Monte Carlo Tree Search Problem

We consider Monte Carlo tree search problem, a variant of Min-Max tree search problem where the score of each leaf is the expectation of some Bernoulli variables and not explicitly given but can be estimated through (random) playouts. The goal of this problem is, given a game tree and an oracle that returns an outcome of a playout, to find a child node of the root which attains an approximate m...

متن کامل

A New Method for Parallel Monte Carlo Tree Search

In recent years there has been much interest in the Monte Carlo tree search algorithm, a new, adaptive, randomized optimization algorithm. In fields as diverse as Artificial Intelligence, Operations Research, and High Energy Physics, research has established that Monte Carlo tree search can find good solutions without domain dependent heuristics. However, practice shows that reaching high perfo...

متن کامل

Monte-Carlo Approximation of Temperature

Monte-Carlo tree search is a powerful paradigm for the game of Go. We propose to use Monte-Carlo tree search to approximate the temperature of a game, using the mean result of the playouts. Experimental results on the sum of five 7x7 Go games show that it improves much on a global search algorithm.

متن کامل

Revisiting Monte-Carlo Tree Search on a Normal Form Game: NoGo

We revisit Monte-Carlo Tree Search on a recent game, termed NoGo. Our goal is to check if known results in Computer-Go and various other games are general enough for being applied directly on a new game. We also test if the known limitations of Monte-Carlo Tree Search also hold in this case and which improvements of Monte-Carlo Tree Search are necessary for good performance and which have a min...

متن کامل

Cooperative Games with Monte Carlo Tree Search

Monte Carlo Tree Search approach with Pareto optimality and pocket algorithm is used to solve and optimize the multi-objective constraint-based staff scheduling problem. The proposed approach has a two-stage selection strategy and the experimental results show that the approach is able to produce solutions for cooperative games.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010